N-gram Distributions in Texts as Proxy for Textual Fingerprints

نویسنده

  • Carl VOGEL
چکیده

Recent experiments using mainly character unigram distributions in authorship attribution tasks are discussed. Results so far indicate efficacy in similarity judgements seemingly good enough for ‘balance of probabilities’ standards, but not yet for proof ‘beyond reasonable doubt’.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Competition of Discourses in Journalistic Translation: Diplomatic Negotiations in Focus

We sought to understand  whether,  how, and  why the translated journalistic texts  related  to  the  Iranian  nuclear  negotiations  were manipulated. To this end,  we  monitored  a news agency’s Webpage in a time span of 46 days that began 3 days before Almaty I nuclear talks and ended  3  days  after  Almaty  II  talks.  Monitoring  resulted  in  a  corpus  made  up  of  36  target  texts  p...

متن کامل

The Effect of Textual Enhancement Types on EFL Learners’ Grammatical Awareness of Simple and Complex Structures

The purpose of this study was to investigate the effect of textual enhancement types on English as Foreign Language ( EFL ) learn er s ’ grammatical awareness of simple and complex structures. A nu m- ber of 104 learners from Ayandegan Institute, in Tehran, participated in this study. To homogenize the samples, the researchers administered a placement Michigan Test of English language to the p...

متن کامل

JU_CSE_NLP: Language Independent Cross-lingual Textual Entailment System

This article presents the experiments carried out at Jadavpur University as part of the participation in Cross-lingual Textual Entailment for Content Synchronization (CLTE) of task 8 @ Semantic Evaluation Exercises (SemEval-2012). The work explores cross-lingual textual entailment as a relation between two texts in different languages and proposes different measures for entailment decision in a...

متن کامل

A Sociopragmatic Analysis of Textual Metadiscourse Markers in English and Persian Scientific Texts

This study aimed to explore the function and frequency of textual metadiscourse markers (MDMs) in scientific English and Persian texts. Based on the qualitative and quantitative analysis of textual markers characterizing the selected genre, four different textbooks, two written in English and two in Persian were analyzed to identify the textual metadiscourse categories (including logical marker...

متن کامل

A Contrastive Study of Theme in English and Azerbaijani Turkish Fictional Texts

Thematisationis one of the troublesome areas both for translation purposes from or into English and also for learning EFL. The main reason for the problem lies in the fact that usually different languages structure thematisation in different ways. Therefore, the present research is an attempt to investigate contrastively: experiential (topical), interpersonal and textual themes in a sample of A...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006